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IMAGE PROCESSING 

This invention relates to image processing and in particular to a system and 
method for processing digital images to detect and extract physical entities or objects 
5 represented in the image. 

Image segmentation is of fundamental importance to many digital image- 
processing applications. The process of image segmentation refers to the grouping 
together of parts of an image that have similar image characteristics and this is often 
the first process in image processing tasks. For instance, in the field of video coding it 

10 is often desirable to decompose an image into an. assembly of its constituent object 
components prior to coding. This pre-processing step of image segmentation then 
allows individual objects to be coded separately. Hence, significant data compression 
can be achieved in vid^o ^sequences since slow moving background can be transmitted 
less frequently than faster moving objects. 

15 Image segmentation is also important in the field of image enhancement, 

particularly in medical imaging such as radiography. Image segmentation can be used 
to enhance detail contained in an image in order to improve the usefulness of the 
image. For instance, filtering methods based on segmentation have been developed 
for removing noise and random variations in intensity and contrast from captured 

20 digital images to enhance image detail and assist human visualisation and perception 
of the image contents. 

Other fields where image segmentation is important include multi-media 
applications such "as video indexing and post production content-based image retrieval 
and interpretation, that is to say video sequence retrieval based on user supplied 

25 content parameters and machine recognition and Interpretation of image contents 
based on such parameters. 

Fundamental to image segmentation is the detection of homogeneous regions 
and/or the t>oundaries of such regions which represent objects in that image. 
' Homogeneity may be detected in terms of intensity or texture, that is grey level values, 

30 motion (for video sequences), disparity (for stereoscopic images), colour, and/or focus 
for example. Many approaches to image segmentation have been attempted including 
texture-based, intensity-based, motion-based and focus-based segmentation. Known 
approaches require significant computational resources • and. often provide 
unsatisfactory results. 
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' One approach that uses Intensity or grey level values for object segmentation 
is thresholding. The concept of image segmentation based on thresholding is 
described in the paper "An Amplitude Segmentation Method Based on the Distribution 
Function of an Image", Compute. Vision, Graphics and Image Processing, 29. 47-59. 
5 1985. In the thresholding method intensity values are determined for each pixel or 
picture element in a digital image and on the basis of these values a threshold value is 
determined that distinguishes each pixel of an object in the image from pixels 
representing background detail. In practice, the threshold intensity value is determined 
dynamically for each image according to the statistical distribution of intensity values. 
10 that is to say, the value is based on a histogram analysis of all the intensity values for a 
particular image. Peaks in the histogram distribution generally represent intensity 
values predominately associated with a particular object. If two objects are present in 
an Image there will be pNO peaks. Jn these circumstances the intersection or overiap 
between the two peaks is taken as the threshold value. This approach to image 
15 segmentation is relatively straightforward but can be computationally intensive 
particularly when complex images are presented, for example, images comprising a 
number of objects or complex backgrounds or when the image is heavily "textured", 
that is to say. the image comprises a number separate regions within an object that 
have different intensity values. When textured images are processed using threshold- 
20 based methods "over-segmentation" can occur, that is, regions within an object are 
themselves recognised as separate objects within the image being processed. 

The problem of over segmentation can be partially overcome if the image is 
simplified prior to thresholding. Image simplification involves the removal of low order 
intensity value differences between adjacent pixels within an object boundary while the 
25 intensity value differences are maintained at the object boundaries. Image 
simplification is often achieved in digital Image processing by using so called non-linear 
diffusion methods. The concept of non-linear diffusion for image processing is 
described in the published paper "Scale Space and Edge Detection Using Anisotropic 
Diffusion", IEEE Trans, on Pattern Analysis and Machine Intelligence Vol, 12 No. 7 
30 pp629*639, July 1990. In this method pixel intensities are altered in a manner 
analogous to diffusion of physical matter to provide regions of homogenous intensity 
within object boundaries while preventing diffusion at the object boundaries, thereby 
preserving Intensity contrast at the boundaries. It has been found, however, that 
methods of image simplification based on known non-linear diffusion algorithms result 
35 in over segmentation. 
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According to a first aspect of the invention there is provided a method of 
processing an image recorded by an imaging device; said method comprising the 
steps of:- 

a) providing an image comprising an array of adjacent elements each 
5 corresponding to a respective part of the image and having a respective intensity value 

associated therewith; 

b) processing said intensity values to determine an intensity contrast value for 
each respective element according to differences in intensity values between 
respective adjacent elements; 

10 c) providing a depth value for each element corresponding to the distance 

between the imaging device and at least part of an image forming object represented 
in the image by the respective element; 

d) processing said depth values to determine a depth contrast value for each 
respective element according to differences in depth values between respective 

15 adjacent elements; and. 

e) processing said intensity contrast values and said depth contrast values to 
identify at least one area of the image corresponding to one or more respective objects 
in the image being processed: 

Thus, by processing depth contrast values with the intensity contrast values, 
20 data relating to the relief of an image, that is the depth of image forming objects (or 
more precisely the distance travelled by the reflected incident radiation) in the image, 
can be used to improve object boundary detection and thereby improve segmentation 
of the image Into its constituent object parts. By using two parameters instead of one 
the accuracy of determining object boundaries can be significantly improved. 
25 Preferably, said depth values are determined according to the spacing 

between corresponding points on a stereoscopic image pair. The spacing between 
corresponding points can be readily converted into depth values based on known 
imaging system geometry. Hence, additional image processing can be minimised. 

In a preferred embodiment, the spacing between said points is determined by 
■30 matching said corresponding points and estimating a vector value for the relative 
spacing and direction of said points. In this way, respective vector values can be used 
to represent the respective depth values associated with the respective elements. 

Conveniently, said area is determined by identifying an outline of said 
respective object or objects in the image. This readily provides for object identification. 
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Preferably, step e) comprises the step of altering the intensity values of each 
respective element in accordance with the respective intensity contrast value and the 
respective depth contrast value of the element. This can increase intensity differences 
between adjacent elements at the respective object boundaries. 
5 In preferred embodiments, the step of altering the intensity values comprises 

the step of modifying the respective intensity contrast values according to the 
respective depth contrast values, and altering the intensity values of the respective 
elements towards an average intensity value determined by the intensity values of 
surrounding elements if the respective modified intensity contrast value of the element 
10 is below a threshold value. This improves image simplification by reducing differences 
in intensity values between elements corresponding to positions within an object. 

Conveniently, the intensity contrast values are modified such that elements 
having a higher than, average, depjh contrast value have their respective intensity 
values altered less than elements having a lower than average depth contrast value. 
15 This increases the difference in the intensity values between adjacent elements 
con-esponding to positions on opposing sides of object boarders. 

Preferably, step e) comprises a non-linear diffusion process for altering 
element intensity values in accordance with respective intensity contrast values 
modified in accordance with respective depth contrast values. In this way, it is possible 
20 to improve known non-linear diffusion methods of image simplification by modifying the 
diffusion process in accordance with further object identifying data, that is to say using 
the depth data associated with each element. 

In a preferred embodiment, the method further comprises the step of de- 
lineating said object or objects from the image. This allows the objects to be stored. 
25 retrieved, coded, or processed separately, for example. 

Preferably, the delineating step comprises the steps oft- 
determining a statistical distribution function of the altered intensity values of 

the respective elements; 

detemiining a threshold value or range of values to include all the intensity 
30 values of the respective elements of at least one identified area in the image; and. 

selecting elements having modified intensity values within the threshold range 
or above or below said threshold value; and 

delineating image data relating to said selected elements from image data 
relating to the remaining elements. In this way it is possible to implement relatively 
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simple thresholding methods to extract the object or objects from the processed 
image. 

According to a second aspect of the invention there is provided a method of 
processing an image in accordance with a non-linear diffusion process; said method 
5 comprising the steps of:- 

i) providing an image comprising an array of adjacent elements each 
corresponding to a respective part of the Image and having a respective intensity value 
associated therewith: 

ii) processing said intensity values to determine an intensity contrast value for 
10 each respective element according to differences in intensity values between 

' respective adjacent elements; 

iii) identifying from said array of elements object defining elements 
corresponding to points pn. respective objects in the image; 

Iv) altering said element intensity values in accordance said respective 
15 intensity contrast values, whereby said elements are altered to a lesser or greater 
extent in dependence on whether said respective element Is an object element. 

In one embodiment, step iii) comprises the step of determining a depth value 
associated with a disparity field for each element in the image and identifying said 
object elements from said depth values. 
20 In another embodiment, step iii) comprises the step of determining a motion 

value associated with a motion vector in a video sequence for each element in the 
• image and identifying said object elements from said motion values. Accordingly, the 
^ non-linear diffusion process can be modified In accordance with object positions 
determined by motion recorded in a video sequence. 
25 According to a third aspect of the present invention there is provided an image 

processing system for processing an image recorded by an imaging device; said 
system comprising:- 

. a) a data receiver for receiving data relating to an image comprising an array 
of adjacent elements each corresponding to a respective part of the image and having 
30 a respective intensity value associated therewith; 

an intensity value processor configured to determine an intensity contrast 
value for each respective element according to differences' in intensity values between 
respective adjacent elements; 
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' a depth value processor configured to determine a depth value for each 
element corresponding to the distance between the imaging device and at least part of 
an image forming object represented in the image by the respective element; 

a depth contrast value processor configured to determine a depth contrast 
5 value for each respective element according to differences in depth values between 
respective adjacent elements; and, 

an object segment processor configured to process intensity contrast values 
and said depth contrast values to identify at least one area of the image corresponding 
to one or more respective objects in the image being processed. 
10 The invention will now be described, by way of example only, with reference to 

the accompanying drawings; in which;- 

Rgure 1 is a schematic block diagram of a system for processing digital 

images; r .t ~ _ 

Figure 2a shows a pair of stereoscopic images of a scene viewed from two 

1 5 different perspectives with a stereoscopic imaging device; 

Figure 2b shows the images of Rgure 2 in side by side relation; 
Figure 3 is a schematic block diagram of an image processor for processing 
digital images in the system of Figure 1 ; 

Figure 4 is a flow chart of a method for processing digital images 
20 Figure 5a is a pre-processed image of a scene comprising an object to be 

segmented; 

Figure 5b is a processed image of the image of Figure 5a processed in 
accordance with~a known non-linear diffusion process; 

Figure 5c is a processed image of the image of Figure 5a showing disparity or 
25 depth vectors for the image of Figure 5a obtained from a stereoscopic image pair; 

Figure 5d is a processed image of the image of Figure 5a processed in 
accordance with a modified non-linear diffusion process utilising the disparity data 
represented in Figure 5c; and. 

Figure 5e is shows an object mask extracted from the processed image of 

30 Figure 5a. 

With reference to Figure 1, in one arrangement of the present invention an 
image processor 102 is arranged to receive digital images from a memory 104 storing 
two dimensional images of three dimensional scenes recorded by means of an optical- 
electronic imaging device 106. The imaging device 106 receives electromagnetic 
35 radiation from all areas of the scene being recorded including one or more distinct 
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image forming objects 108 within the imaging device's field of view 110. The imaging 
device can be any device capable of forming optical-electronic images, including for 
example an array of light sensitive photo-diodes or the like connected to respective 
charged coupled devices for forming a digital image of picture elements or pixels 
5 capable of being stored in electronic digital memory 104. The pixels each have a grey 
level value associated with them representative of the brightness orHntensity of the 
respective part of the scene they represent. Data relating to the colour associated with 
each pixel may also be stored in the memory 104. 

In the present arrangement the imaging device comprises two separate 

10 optical-electronic imaging systems for recording stereoscopic image pairs. Figure 2a 
shows a pair of images, 200 to the left.of the drawing and 202 to the right, that define a 
stereoscopic image pair corresponding to two different perspective projections in 
slightly different plants .of jhe same scene. The image processor 102 is programmed 
in a known manner to process stereoscopic image pairs of the type shown to obtain 

15 data relating to the depth of the or each object and the background in a scene, or more 
precisely, the distance travelled by the incident electromagnetic radiation reflected by 
the or each object or background to the respective light sensitive pixels of the imaging 
device. The image processor is programmed to determine disparity vectors in much 
the same way that conventional image processes are programmed to determine 

20 motion vectors for object segmentation prior to video sequence coding. For instance, 
depth is estimated from the stereoscopic images by estimating a disparity vector for 
each pair of corresponding points in the image pair. In Figure 2 a point 204 on an 
object in a scene".has a position defined by the spatial co-ordinates (x,y,z). This point is 
projected on the left image at a. point 206 having the local spatial rco-ordlnates (x,y)l 

25 and likewise on the right image at a point 208 having the spatial co-ordinates (x.y)r. 
The left and right images have the same co-ordinate reference frame and so the 
distance and direction between the two corresponding points 206 and 208, known as 
the disparity vector, can be readily determined. 

Rgure 2b shows the two images 200 and 202 in side by side relation. The 

30 disparity vector 210 for corresponding points 206 and 208 is shown on the right hand , 
image 202. The vector extends between the projected point 206 of image 200 and 
point 208 on image 202. • 

It is possible to determine the distance of a point in an image from the 
disparity vector for that point based on knowledge of the imaging system geometry. 

35 The estimation of depth in an image using stereoscopic imaging is described in detail* 
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in the paper "Depth Based Segmentation" IEEE Transaction on Circuits and Systems 
for Video Technology. 7(1 ), February 1 997, pp237-239. 

in the arrangement of Figure 3, the image processor 102 comprises a data 
receiving interface 302 for receiving data defining stereoscopic image pairs of a scene 
5 or sequence of scenes from the memory 104, The data-receiving interface is 
connected to a first processor 304 which is programmed to detemnlne an intensity 
contrast value for each of the pixels in one or both stereoscopic images. The intensity 
contrast value is the iritensity or grey level gradient at the respective pixel detennlned 
by the local variation in intensity in the adjacent pixels. The receiving interface is also 
10 connected to a second processor 306 which includes a first module 308 programmed 
to determine the disparity vector associated with each pixel and a second module 310 
programmed to determine a disparity or depth contrast value for each pixel. The 
disparity or depth <3>ntrast value is the disparity or depth value gradient at the 
respective pixel determined by thelocal variation in depth values associated with the 
15 adjacent pixels. The first 304 and second 306 processors are connected to a third 
processor 312 which is programmed to process the image in accordance with a non- 
linear diffusion process based on the intensity contrast and depth contrast values 
determined by the respective first and second processors. A fourth processor 314 is 
connected to the third processor 312 for processing the image data simplified by the 
20 processor 312 to delineate and extract groups of neighbouring pixels representing 
physically meaning entities or objects contained within the image being processed. 

The image processor of Figure 3 is programmed to segment an image by first 
simplifying the image and then extract objects from the image by histogram based 
threshold analysis and extraction. An example of an image segmentation method will 
25 now be described with reference to the flowchart of Figure 4. 

Data defining a pair of stereoscopic images of a scene or a sequence of 
images pairs constituting a video sequence are read from memory 104 by the interface 
302 of the image processor 102 in step 400. The image data is stored in the memory 
104 as a set of grey level values, one for each pixel. In step 402 the grey level values 
30 are processed by the processor 304 to determine the local variation in intensity in the 
region of each respective pixel to determine a respective contrast value for each of the 
pixels. Subsequently or simultaneously, image data defining an image pair is 
processed by the processor 306. first in step 404 by processor 308 to determine 
respective disparity vectors 210, and second in step 406 to determine respective depth 
35 contrast values based on the local variation in disparity vector values in the region of 
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. each respective pixel. Step 404 can be based on the method disclosed in the paper 
"Depth Based Segmentation" IEEE Transaction on Circuits and Systems for Video 
Technology, 7(1), February 1997, pp237-239, the contents of which are incorporated 
herein by reference. 

5' The image is simplified in step 408 by processor 312 according to a data 

dependent non-linear diffusion process. Step 408 involves altering the respective pixel 
intensity values by modifying -the respective intensity contrast values according the 
' corresponding depth contrast values determined in steps 402 and 406 respectively. 
The intensity values are altered towards an average intensity value determined by the 

1 0 intensity values of the respective surrounding pixels if the modified respective contrast 
: value for the pixel is below a certain value. In this regard, the intensity contrast values 
are modified such that pixels having a higher than average depth contrast value have 
their respective intensity values altered less than elements having a lower than 
average depth contrast value. Since step 408 is analogous to a physical diffusion 

15 process the step is iterative and repeats until a pre-determined equilibrium is achieved. 
The process of step 408 ultimately provides an image where the intensity values tend 
to an equilibrium value within the region corresponding to an object within the image, 
that is to say the or each object is represented by a separate homogeneous region of 
intensity. The diffusion process is considerably reduced in regions corresponding to 

20 object boundaries so that there is significant contrast in intensity between objects and 
objects and between objects and background within an image of a scene. An example 
of the process of step 408 is described in greater detail in. the example described 
below. 

In step 410 the processed image data of the simplified image is processed by 
25 the processor 314 to determine an image segmentation grey level threshold value for 
image segmentation. In step 412 one or more objects are extracted from the image 
according to the modified intensity values of the respective pixels. Steps 410 and 412 
may be implemented in accordance with the histogram based segmentation method 
described in the paper "An Amplitude Segmentation Method Based on tiie Distribution 
30 Function of an Image", Compute, Vision, Graphics and Image Processing, 29, 47-59, 
1 985 mentioned above. 

In the method described with reference to Figure 4. an image contairiing one 
or more structurally meaningful entities or objects is first simplified, that is to say the 
image is processed to remove inconsequential detail from the image, and then 
35 segmented into regions containing respective entities or objects. In the example 
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described, image simplification is based on a modified non-linear diffusion process 
involving grey level intensity values of respective picture elements or pixels comprising 
the image. The process of step 408 will now be described with reference to the 
following mathematical example. 

Example 

Mathematically the process of diffusion can be described , by the following 
partial differential equation, known as the diffusion equation: 
/, =i/iv(r-V/) (1) 
Equation (1) embodies two important properties: first, the equilibration property stated 
by Pick's law. ^J = -r- V/ , where V/ is the concentration gradient. ^ is the flux, and 
r is the diffusion tensor; and second, the continuity property given by /, =-f/iv(^). 
Thus the concentration /, is equal to the -ve flux divergent. 

In the context of the present invention the concentration /, or /(jc.y.f) is 
identified as the intensity (grey level value) at any spatial sampling position /(.t,y)of 
the evolved image at a time t. 

If the diffusion tensor r is constant over the whole image, then Equation (1) describes 
a linear diffusion model. 

/, =cV-/ (2) 

where c is the diffusion constant and V^/ the Laplacian of the image intensity. 

If the diffusion tensor r in Equation (1) is defined as a function of the local 
energy variation, that is the local image intensity (or grey level value) gradient, at an 
image position (x. y). r = /(x. y.r) . a diffusity function, then Equation (1) leads to. 



25 



/, =V.[/(.t.y.f)V/l = Jiv(/(x.y.OV/) (3) 

Equation (3) defines a non-linear diffusion process in which local averaging of 
grey level values is inhibited in regions of object boundaries and diffusion velocity is 
controlled by the local Intensity (or grey level value) gradient. Local averaging is the 
process of averaging the grey level values of adjacent pixels and replacing the current 
30 grey level value of a pixel with this average value. 
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If the diffusity function f(.) is chosen as a continuously decreasing function of 
the image gradient, the diffusion process approximates to a constant solution, or 
equilibrium, representing a simplified image with sharp boundaries. The amount of 
diffusion in each pixel or image point is modulated by a function of the image gradient 
5 at that point. Accordingly, image regions of high Intensity contrast undergo less 
diffusion, whereas uniform regions are diffused considerably. 

Equation (3) may be combined together with a rapidly decreasing diffusivity 

function: 

10 and this diffusivity function leads to a flux function ^ of the fomn: 

■ >>(iiv/f)=Aiv/f)iv/||= ^^^j^^^^ : - (5) 

Where K is a threshold value. 

Thus the derivative of equation (5) is positive for ljV/|| < K and negative for 
]|V/|| > K . Consequently the diffusion process behaves in a forward parabolic manner 

15 for ||V/|| < K , while it behaves in a backward parabolic manner for 1|V/| > K . That is, 
Equation (5) presents a contrasting behaviour according to the magnitude of the image 
intensity gradient. It will sharpen edges with a local gradient greater than K, while 
smoothing edges with gradient magnitude lower than K. The value of K can be 
determined experimentally. Figures 5a and 5b show respective pre and post processed 

20 images where the image has been processed using the above-defined non-linear 
diffusion mathematical model. 

The above model is improved by using the disparity values associated with 
the respective pixels since these values vary considerably at object borders. In 
addition, the accuracy of disparity or depth estimation can be substantially increased at 
25 the object borders given the known object outline from the intensity contrast values. 

(n one example of the present invention the disparity values are used to control 
the diffusion when non-linear diffusion is applied. Rgure 5c shows the distribution of 
disparity values for the a stereoscopic image pajr corresponding to the image of Figure 



wo 01/43071 



12 



PCT/CBOO/04707 



5a. In this representation only the horizontal component of the respective disparity 
vectors is shown. The magnitude of the vector is represented by grey values. As 
shown, the approximate position of the object boundaries coincide with the image 
regions where the disparity variation is high. Thus, by analysing the local variation of 
5 the disparity vectors it is possible to detect the position of the respective object 
borders. 

The degree of smoothness g{z) of the disparity vectors at any sampling 
position 2 = (jc, y) . 500 in Figure 5c. is obtained by measuring the statistical variance 
of the disparity vectors inside a small observation window 502 centered at position 
10 500. The size of the window 502 is for example 8x8 pixels. The smoothness can be 
expressed as: 

5-(c)=V<y;+<Ty . ' - ^ - (6)- 

where a; and cr; are. respectively, the variances of the horizontal and vertical 
components of the disparity vectors inside the window 502. 
15 The diffusivity f(.) in Equation (4) is now defined as function of a ^ -weighted 

image gradient Jv/|^^. That is, at each sampling position zthe magnitude of the image 
gradient is weighted by its local disparity variance g{z). So if ^^ax is the maximum 
variance of the considered disparity field and ^ : [0, ^^ax 1 ^0,1] . that is any 
increasing control function satisfying the two conditions g(0)=0 and g( )=h then: 

20 ||v/|^=^(«:(^))||v/l'. (7) 

There are several choices for the control function g. For example a suitable 
family of functions is given by: 

[ 1 otherwise 
where C s (0, 1) is a threshold modulating the influence of g in the diffusion 

25 process. 
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Applying the parabolic diffusion Equation (3) with diffusivity function 
/(|V/(.r, v./)|^) an iterative disparity-driven or depth-driven diffusion process model 
is defined. 

Rgure 5d shows a simplified image of Figure 5a when disparity-controlled 
5 ^ diffusion is applied according to the above mathematical model. 

It can be seen from Figure 5d that the above-described disparity-driven non- 
linear diffusion model is particularly appropriate for both object segmentation and 
pattern recognition image processing. Masks of complete physical objects can easily 
be extracted from the processed images using known histogram-based thresholding 
10 methods. An example of an extracted mask of the image of Figure 5a is shown in 
. Figure 5e. ^ . ; ^ . . 

Although the present invention has been described with reference to 
stereoscopic disparity-driven non-linear diffusion it will be understood that other 
embodiments of the present invention could be readily implemented by the person 

15 skilled in the art without further inventive contribution. For example, the depth values 
- could instead be obtained by using an active imaging device comprising a low power 
laser range finder to simultaneously obtain depth information relevant to respective 
pixels in an image or image sequence. In addition the data-driven aspects of the above 
described non-linear diffusion process could be readily implemented for video 

20 sequences using_ motion values instead of the disparity values and determined in a 
similar way as the disparity values but using monoscopic sequential frames of a video 
sequence instead of stereoscopic image pairs, for example. 
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CLAIMS 



1. An method of processing an image recorded by an imaging device; said 
method comprising the steps of :- 
5 a) providing an image comprising an array of adjacent elements each 

corresponding to a respective part of the image and having a respective intensity value 
associated therewith; 

b) processing said intensity values to determine an intensity contrast value for 
each respective element according to differences in intensity values between 

10 respective adjacent elements; 

c) providing a depth value for each element corresponding to the distance 
between the imaging device and at least part of an image forming object represented 
in the image by the re^sptective element; 

d) processing said depth values to determine a depth contrast value for each 
15 respective element according to differences in depth values between respective 

adjacent elements; and, 

e) processing said intensity contrast values and said depth contrast values to 
identify at least one area of the image con-esponding to one or more respective objects 
in the image bein^ processed, 

20 

2. A method according to claim 1 wherein said depth values are determined 
according to the spacing between corresponding points on a stereoscopic image pair. 

3. A method according to claim 2 wherein the spacing between said points is 
25 determined by matching said corresponding points and estimating a vector value for 

the relative spacing and direction of said points. 

4. A method according to any preceding claim wherein said area is determined 
by identifying an ouUine of said respective object or objects in the image. 



30 



5. A method according to any preceding claim wherein step e) comprises the 
step of altering the intensity values of each respective element in accordance with the 
respective intensity contrast value and the respective depth contrast value of the 
element. 
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6. ' A method according to daim 5 wherein the step of altering the intensity values 
comprises the step of modifying the respective intensity contrast values according to 
the respective depth contrast values, and altering the intensity values of the respective 
elements towards an average intensity value deteiinined by intensity values of 

5 surrounding elements if the respective modified intensity contrast value of the element 
is below a threshold value. 

7. A method according to claim 6 wherein the intensity contrast values are 
- modified such that elements having a higher than average depth contrast value have 

10 their respective intensity values altered less* than elements having -a lower than 
average depth contrast value. 

8. A method according to any preceding claim wherein step e) comprises a non- 
linear diffusion process for altering- element intensity vailues in accordance with 

15 respective intensity contrast values modified in accordance with respective depth 
contrast values. 

9. A method according to any one of claims 5 to 8 further comprising the step of 
de-lineating said object or objects from the image. 

20 

10. A method according to claim 9 wherein the delineating step comprises the 
steps of:- 

determirring a distribution of the altered intensity values of the respective 
elements; 

25 * determining a threshold value or range of values to include all the intensity 

values of the respective elements of at least one identified area in the Image; and, 

selecting elements having modified intensity values within the threshold range 
or above or below said threshold value; and 

delineating image data relating to said selected elements, from image data 
30 relating to the remaining elements. 

11. An method of processing an image in accordance with a non-linear diffusion 
process; said method comprising the steps of:- 
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' a) providing an image comprising an array of adjacent elements each 
corresponding to a respective part of the image and having a respective intensity value 
associated therewith; 

b) processing said intensity values to determine an intensity contrast value for 
5 each respective element according to differences in intensity values between 

respective adjacent elements; 

c) identifying from said array of elements object defining elements 
corresponding to points on respective objects in the image; 

d) altering said element intensity values in accordance said respective 
10 intensity contrast values, whereby said elements are altered to a lesser or greater 

extent in dependence on whether said respective element is an object element. 

12. A method ajccprding to claim 11 wherein step c) comprises the step of 
determining a depth value associated with a disparity field for each element in the 

15 image and identifying said object elements from said depth values. 

13. A method according to claim 11 wherein step c) comprises the step of 
determining a motion value associated with a motion vector in a video sequence for 
each element in the image and identifying said object elements from said motion 

20 values. 

14. A system configured to implement a method according to any preceding claim 

1 5. An image processing system for processing an image recorded by an imaging 
25 device; said system comprising:- 

a) a data receiver for receiving data relating to an image comprising an array 
of adjacent elements each corresponding to a respective part of the image and having 
a respective intensity value associated therewith; 

an intensity value processor configured to determine an intensity contrast 
30 value for each respective element according to differences in intensity values between 
respective adjacent elements; 

a depth value processor configured to determine a depth value for each 
element corresponding to the distance between the imaging device and at least part of 
an image forming object represented in the image by the respective element; 



wo 01/43071 



17 



PCT/CBOO/04707 



a depth contrast value processor configured to determine a depth contrast 
value for each respective element according to differences in depth values between 
respective adjacent elements; and, 

an object segment processor configured to process intensity contrast values 
5 and said depth contrast values to identify at least one area of the image corresponding 
to one or more respective objects in the image being processed. 
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